Video processing method and apparatus, device, storage medium and computer program product

ABSTRACT

Embodiments of the present disclosure provide a video processing method and apparatus, a device, a storage medium and a computer program product, where the method includes: playing an audio in a process of capturing a video; playing a first part of the audio at a changing speed according to a first speed change mode in response to recognizing that a target object presents a target state in a captured video image; obtaining a target video according to the captured video image and the audio; and playing, in the target video, a video image captured before the target object presents the target state at a constant speed, and playing, in the target video, a video image captured after the target object presents the target state at a changing speed according to a second speed change mode, where the audio is played at a constant speed.

CROSS-REFERENCE TO RELATED APPLICATIONS

This application is a continuation of International Application No.PCT/CN2021/138507, filed on Dec. 15, 2021, which claims a priority ofChinese Pat. Application No. 202011508607.2, filed on Dec. 18, 2020, andentitled “Video Processing Method and Apparatus, Device, Storage Mediumand Computer Program Product”. Both of the aforementioned patentapplications are incorporated herein by reference in their entireties.

TECHNICAL FIELD

Embodiments of the present disclosure relate to the technical field ofcomputers, and in particular, to a video processing method andapparatus, a device, a storage medium and a computer program product.

BACKGROUND

A video recording function is a common function of a video camera or avideo recording application of a client. With the development oftechnology, an application of the video recording function is becomingmore and more widespread.

At present, the video recording function can make various kinds ofvideos, such as making slow-motion videos or fast-motion videos, etc.However, making various kinds of videos is based on that a section of acaptured video is imported into a video editing software or an editingpage for a corresponding slow playback process or a corresponding fastplayback process, and then a slow-motion video playback or a fast-motionvideo playback is completed.

Therefore, the video speed change recording mode in the prior art hascomplicated operating steps and complicated processing procedures, andcannot provide convenient operation for a user, thus affecting the userexperience.

SUMMARY

Embodiments of the present disclosure provide a video processing methodand apparatus, a device, a storage medium and a computer programproduct, which are capable of providing a convenient operation for auser, and the processing procedure is simple, so as to meet the userrequirements and improve the operation experience of the user.

In a first aspect, an embodiment of the present disclosure provides avideo processing method, and the method includes:

-   playing an audio in a process of capturing a video;-   playing a first part of the audio at a changing speed according to a    first speed change mode in response to recognizing that a target    object presents a target state in a captured video image; where the    first part of the audio represents an unplayed part of the audio    when it is recognized that the target object presents the target    state in the captured video image;-   obtaining a target video according to the captured video image and    the audio;-   playing, in the target video, a video image captured before the    target object presents the target state at a constant speed, and    playing, in the target video, a video image captured after the    target object presents the target state at a changing speed    according to a second speed change mode, where the audio is played    at a constant speed; and-   where a speed change direction of the first speed change mode is    opposite from a speed change direction of the second speed change    mode and a changing speed of the first speed change mode is the same    as a changing speed of the second speed change mode.

In a second aspect, an embodiment of the present disclosure provides avideo processing apparatus, and the apparatus includes:

-   a capturing module, configured to play an audio in a process of    capturing a video;-   a processing module, configured to play a first part of the audio at    a changing speed according to a first speed change mode in response    to recognizing that a target object presents a target state in a    captured video image; where the first part of the audio represents    an unplayed part of the audio when it is recognized that the target    object presents the target state in the captured video image;-   the processing module is further configured to obtain a target video    according to the captured video image and the audio; and-   a playing module, configured to play, in the target video, a video    image captured before the target object presents the target state at    a constant speed, and play, in the target video, a video image    captured after the target object presents the target state at a    changing speed according to a second speed change mode, where the    audio is played at a constant speed; and-   where a speed change direction of the first speed change mode is    opposite from a speed change direction of the second speed change    mode and a changing speed of the first speed change mode is the same    as a changing speed of the second speed change mode.

In a third aspect, an embodiment of the present disclosure provides anelectronic device, including:

-   at least one processor; and memory;-   the memory stores computer-executed instructions; and-   the at least one processor executes the computer-executed    instructions stored in the memory to cause the at least one    processor to execute the video processing method as described in the    above first aspect.

In a fourth aspect, an embodiment of the present disclosure provides acomputer readable storage medium storing computer-executed instructionswhich, when executed by a processor, implement the video processingmethod as described in the above first aspect.

In a fifth aspect, an embodiment of the present disclosure provides acomputer program product including a computer program which, whenexecuted by a processor, implements the video processing method asdescribed in the above first aspect.

In a sixth aspect, an embodiment of the present disclosure provides acomputer program which, when executed by a processor, implements thevideo processing method as described in the above first aspect.

The video processing method and apparatus, the device, the storagemedium and the computer program product provided by the embodiments ofthe present disclosure allow for: playing an audio in a process ofcapturing a video, then playing a first part of the audio at a changingspeed according to a first speed change mode in response to recognizingthat a target object presents a target state in a captured video image,where the first part here represents an unplayed part of the audio whenit is recognized that the target object presents the target state in thecaptured video image; then obtaining a target video according to thecaptured video image and the audio, playing, in the target video, avideo image captured before the target object presents the target stateat a constant speed, and playing, in the target video, a video imagecaptured after the target object presents the target state at a changingspeed according to a second speed change mode, where the audio is playedat a constant speed, and where a speed change direction of the firstspeed change mode is opposite from a speed change direction of thesecond speed change mode and a changing speed of the first speed changemode is the same as a changing speed of the second speed change mode.Therefore, it is ensured that the audio is played at a constant speed inthe target video, while a part of the video image is played at aconstant speed and another part of the video image is played at achanging speed, thereby realizing the speed change recording and playingof the captured video. The present application can start the operationof controlling the video speed change recording directly by recognizingthe target state presented by the target object, without the need toimport the captured video into the video editing software or the editingpage for a corresponding slow playback process or a fast playbackprocess. In order to improve the user experience and make a user have astrong perception for the speed change recording, after the speed changerecording is triggered in the video capturing process, the played audiois played at a changing speed according to the first speed change mode,while in the process of playing the target video generated aftercapturing, for the video captured after the speed change is triggered,the video image is played at a changing speed according to the secondspeed change mode, while the audio is played at a constant speed. Thus,the personalized needs of users are satisfied, and the processingprocedure is simple and the operation steps are simplified, causing theuser to realize the speed change processing of the video throughconvenient operation, thereby improving the user experience.

BRIEF DESCRIPTION OF THE DRAWINGS

In order to explain the embodiments of the present disclosure or thetechnical solutions in the prior art more clearly, the drawings thatneed to be used in the description of the embodiments or the prior artwill be briefly introduced in the following. Obviously, the drawings inthe following description are some embodiments of the presentdisclosure, and for those of ordinary skilled in the art, other drawingscan be obtained according to these drawings without paying creativeeffort.

FIG. 1 is a schematic diagram of a capturing interface provided by anembodiment of the present disclosure.

FIG. 2 is a schematic diagram of a scenario for a video processingmethod provided by an embodiment of the present disclosure.

FIG. 3 is a schematic diagram of a scenario for a video processingmethod provided by another embodiment of the present disclosure.

FIG. 4 is a flow chart of a video processing method provided by yetanother embodiment of the present disclosure.

FIG. 5 is a flow chart of a video processing method provided by anembodiment of the present disclosure.

FIG. 6 is a schematic diagram of a scenario for a video processingmethod provided by a further embodiment of the present disclosure.

FIG. 7 is a structural block diagram of a video processing apparatusprovided by an embodiment of the present disclosure.

FIG. 8 is a schematic structural diagram of hardware of an electronicdevice provided by an embodiment of the present disclosure.

DESCRIPTION OF EMBODIMENTS

Embodiments of the present disclosure will be described in greaterdetail below with reference to the drawings. Although some embodimentsof the present disclosure are shown in the drawings, it should beunderstood that the present disclosure may be implemented in variousforms and should not be construed as limited to the embodiments setforth herein. Rather, these embodiments are provided for a more thoroughand complete understanding of the present disclosure. It should beunderstood that the drawings and embodiments of the present disclosureare for illustrative purposes only and are not intended to limit thescope of protection of the present disclosure.

It should be understood that steps described in the methodimplementations of the present disclosure can be executed in differentorders or in parallel. In addition, the method implementations mayinclude additional steps and/or omit the shown steps. The scope of thepresent disclosure is not limited in this respect.

As used herein, the term “include” and its variations are open-endedinclusion, i.e., “including but not limited to”. The term “based on”means “based at least in part”. The term “one embodiment” means “atleast one embodiment”; the term “another embodiment” means “at least oneother embodiment”; and the term “some embodiments” means “at least someembodiments”. Relevant definitions of other terms will be given in thefollowing description.

Note that the concepts “first”, “second”, etc., mentioned in the presentdisclosure are used only to distinguish between different apparatuses,modules or units, and are not intended to limit the order orinterdependence of the functions performed by these apparatuses, modulesor units.

It should be noted that the modifications of “one” and “more than one”mentioned in the present disclosure are illustrative but notrestrictive, and it should be understood by those skilled in the artthat, they should be understood as “one or more” unless otherwiseclearly indicated in the context.

The names of messages or information interchanged between a plurality ofapparatuses or modules in the embodiments of the present disclosure arefor illustrative purposes only and are not intended to limit the scopeof those messages or information.

At present, the video recording function can make various kinds ofvideos, such as making slow-motion videos or fast-motion videos, etc.However, making various kinds of videos is based on that a section ofthe captured video is imported into a video editing software or acorresponding slow playback process or a corresponding fast playbackprocess is performed on an editing page, and then making a video iscompleted, thereby achieving a slow-motion video playback or afast-motion video playback. In addition, in the prior art, there areplatforms with fast and slow speed capturing functions, but the fast andslow speed herein is only processed and presented to a user in real timeduring capturing, and the requirements for real-time processing aredifficult and the recording time is short, usually 1s. Therefore, thevideo speed change recording mode in the prior art has complicatedoperating steps and complicated processing procedures, and cannotprovide convenient operation for a user, thus affecting the userexperience.

In order to solve the above problems, the technical concept of thepresent disclosure is that a speed change triggering operation can beconfigured, i.e., by recognizing the speed change triggering operationand an acquired second speed change mode for speed change recording,firstly, based on the second speed change mode, a first speed changemode matching the second speed change mode (i.e., a speed changedirection of the first speed change mode is opposite from a speed changedirection of the second speed change mode and a speed change speed ofthe first speed change mode is the same as a speed change speed of thesecond speed change mode) is acquired, then the video image capturedafter the target object recognized in the captured video image presentsa target state is played at a changing speed according to the secondspeed change mode, which is used to prompt a user to start a speedchange processing, and then a part of the video image in the targetvideo generated by the video image and the audio is played at a constantspeed and a part of the video image is played at the second speed changemode, while ensuring that the audio in the target video is played at aconstant speed, thereby completing the speed change processing for thevideo. The processing procedure is simple and the operation steps aresimplified, causing the user to realize speed change recording throughconvenient operations, thereby improving the user experience.

The technical solutions of the present disclosure will be explained indetail in the following specific embodiments. The following specificembodiments can be combined with each other, and the same or similarconcepts or processes may not be repeated in some embodiments.

In practical applications, the execution subjects of the embodiments ofthe present disclosure may be a terminal device, such as a mobileterminal, a computer device (e.g., a desktop computer, a notebookcomputer, an all-in-one computer, etc.), and the mobile terminal mayinclude a mobile device with a playing function, such as a smart phone,a handheld computer, a tablet computer, a wearable device with a displayscreen, etc. In practical applications, a displayed user interface, suchas a capturing interface, can be used to capture or record a targetobject, where the target object can be a person, a scene, etc., and theplaying interface can be used to play a target video, and the targetvideo can be an audio and video processed by speed change recording,which is not specifically limited by the present disclosure.

In a scenario, referring to FIG. 1 , FIG. 1 is a schematic diagram of acapturing interface provided by an embodiment of the present disclosure,where a terminal device can display a capturing interface 10 forcapturing a target object, and the capturing interface 10 is providedwith a capturing button 101, a capturing duration switching button 102,a special effect editing button 103, a selecting music button 104, andan album button 105, and may also include other capturing functiondisplay area 106. Where the capturing operation of the target object canbe triggered by clicking the capturing button 101. Those skilled in theart should understand that the positions and the sizes of the capturingbutton 101, the capturing duration switching button 102, the specialeffect editing button 103, the selecting music button 104 and the othercapturing function display area 106 can be set according to actualrequirements. Where the capturing button 101 is configured to provide acapturing start function for a user; the capturing duration switchingbutton 102 is configured to provide a capturing duration switchingfunction for a user; the special effect editing button 103 is configuredto provide a special effect for a user, where a default music can becontained in the special effect, the target state presented by acaptured object (i.e., the target object) can be recognized through thespecial effect editing effect, so as to trigger a speed changeprocessing operation; the selecting music button 104 can provide a musiclibrary for a user; and the album button 105 is configured to providecapturing materials for a user.

The video processing method according to the embodiments of the presentdisclosure can be applied to capturing or recording scenarios, which arespecifically video speed change recording scenarios, for example,triggering any configured special effect editing effect is supported,where the special effect editing effect is used to support the functionof controlling video speed change recording; and can recognize whetherthe target state is presented in the captured video image, if the targetstate is recognized during the capturing or recording procedure, i.e.,the triggering operation for controlling video speed change recording isdetected during the capturing or recording procedure, speed changeprocessing, such as speed change processing for a video image or speedchange playing for an audio during capturing, can be realized, , withoutcomplicated operating steps. The present disclosure does not limitapplication scenarios.

The technical solutions of the present disclosure will be explained indetail in the following specific embodiments. The following specificembodiments can be combined with each other, and the same or similarconcepts or processes may not be repeated in some embodiments. Theterminal device can display a plurality of functions in the functiondisplay area in the capturing interface 10, such as a flip, a filter, abeautification, a countdown, a special effect editing effect, an album,selecting music, a capturing duration, etc., where the special effectediting effect herein can include a special effect editing effect usedto support the control of video speed change recording function, such asa predefined sticker, in which the special effect can include therequired speed change processing rate, recording at the required ratecan realize speed change recording, and then the speed change mode(including speed change direction and speed change speed) configured bythe special effect editing effect can be obtained by recognizing thatthe target object presents the target state; and selectable speed andspeed change direction (fast or slow speed) for speed change processingcan also be provided to a user through the capturing interface, and thenthe target state presented by the target object is recognized throughthe recognizable speed change trigger mode configured based on the speedchange mode selected by the user in combination with the special effectin the special effect editing effect.

The special effect editing effect configured by the embodiments of thepresent disclosure can control speed change recording by recognizingtrigger actions or specific objects. Take a sticker as an example, thesticker can be configured as follows: the sticker starts capturing andresets; for the sticker, recording is capable of being valid, but thepreview is invalid; the corresponding speed change rate of the stickeris configured in the resource package; the speed change playing eventand speed change rate should be transmitted to a video expression (VE)software development kit (SDK) and the client in real time; a speedchange recording message interface is added, which requires the slowestcontrol within 10 frames; upon making a sticker, speed up and add aframe properly, and slow down after entering the editing page to becomenormal speed, and if a slow down effect following the image is required,the normal speed of the sticker is feasible, and afterwards, particleeffect, physical effect (cloth, feather, etc.) can only be slowed down;the speed change playing event and the speed change rate are transmittedto the VE and the client in real time, a new interface may be neededherein; where this sticker is forbidden to be duetted; the sticker doesnot record the original sound; and the sticker needs to disable afunction of adjusting the rate, which are not limited by the presentdisclosure.

Specifically, for example, a captured object is taken as a character,and the special effect editing effect itself is configured with a speedchange trigger special effect and a speed change mode (it can be thatone special effect editing effect corresponds to one speed changetrigger special effect, and one special effect corresponds to one speedchange mode; it can also be that one special effect editing effectcorresponds to a plurality of speed change trigger special effects, andone special effect corresponds to one speed change mode; it can also bethat one special effect editing effect corresponds to a plurality ofspeed change trigger special effects, and a plurality of special effectscorrespond to one speed change mode; and it can also be that a pluralityof special effect editing effects correspond to one speed change triggerspecial effect, one special effect corresponds to one speed change mode,etc., and there is a mapping relationship between a special effect and aspeed change mode, which is not specifically limited herein), thecharacter is displayed on the capturing interface 10. On the capturinginterface 10, the user can select a capturing duration, such as 15 sshown in FIG. 1 , which means that the current capturing interfaceallows the longest capturing duration of 15 s. The user clicks thespecial effect editing button 103 (see (a) of FIG. 2 ) to display thespecial effect editing effect display field 1031, including a pluralityof stickers (see (b) of FIG. 2 ), selects a predefined sticker in thespecial effect editing effect, such as a sticker 1. And then theposition of the special effect editing button is replaced with thesticker 1 (see (c) of FIG. 2 ), and the background image correspondingto the sticker 1 is displayed on the capturing interface. The user canalso click the above other functions at the same time, for example,clicking the selecting music button 104 (see (a) of FIG. 3 ) to displaythe selecting music display interface 1041 (see (b) of FIG. 3 ), and theuser can search the target music from the search field and confirm theuse, such as music 1 (see (c) of FIG. 3 ).

It should be noted that the operation sequence shown in (b) of FIG. 2and (a) of FIG. 3 is not particularly limited. On the basis of (c) ofFIG. 2 and (c) of FIG. 3 , the user can turn on the capturing functionby clicking the capturing button 101 in the capturing interface (see (a)of FIG. 4 )). After the capturing function is turned on, the terminaldevice starts capturing the target object, and a progress bar 110 ofcapturing time and an ending capturing button 107 are displayed on thecurrent capturing interface, and the functional modules are hidden. Atthe same time, the capturing button 101 is updated to the capturingstate, i.e., the capturing button 101 is switched to a capturing pausebutton 1011 (see (v) of FIG. 4 ). When a special effect (such as asmile) is detected in the capturing procedure, the speed changerecording function can be triggered (take slow recording as an example,see (c) of FIG. 4 ). In the capturing procedure, the video image isplayed at a constant speed, and at the same time, the audio is played ata rate opposite to that required by the special effect, and when thevideo is played after the capturing is complete, the captured videoimage after the acquired triggered speed change operation is playedaccording to the speed change mode required by the characteristics, andthe audio is played at a constant speed, thus ensuring that the audioplayed when the complete captured video is played is at a normal speed.

Take a special effect of the special effect editing effect as a smile asan example, if a smile action is detected in the capturing procedure,the operation for controlling video speed change recording is triggered,and the required speed change recording rate of the smile special effect(for example, 2x slow) is obtained, and at the same time, the audiotriggered by the special effect will be processed and played in thebackground (here referred to as an audio and video terminal) at 2x fast.At this time, in order to ensure that the audio played during playingafter recording is not interfered by the original sound played and/orrandomly appeared before the speed change recording, captured originalsound, such as ambient sound and played music original sound, is notrecorded during the audio processing. It should be noted that, duringthe recording procedure, the speed change recording can be imperceptibleto the user, and the video image and audio can be played at a normalspeed, or in order to satisfy the user experience, for the speed changerecording, the captured video image is played at a constant speed duringthe capturing procedure, and the unplayed part of the audio after thetrigger speed change operation is turned on is played according to thesecond speed change mode that matches the first speed change moderequired by the special effect. After the capturing is ended, the audioand video recorded before the special effect appears is played at anormal speed, and after the special effect triggering operation (i.e.,speed change triggering operation) is detected, the video is played at arequired rate of the special effect and the audio is played at a normalspeed, so that the video speed change recording processing is realized,convenient operation is provided for a user, the processing procedure issimple, thus the user requirements are satisfied, and the operationexperience of the user is improved.

The constant speed of video herein refers to the capturing speed, andthe constant speed of audio refers to its own speed. Applicationscenarios are not limited by the present disclosure.

In addition, it should be understood by those skilled in the art thatthe “trigger” operation can include (but is not limited to): a clicktrigger, i.e., the user clicks the corresponding button or any positionin the interface to realize the trigger; a gesture trigger, i.e., theuser realizes a trigger operation by making a corresponding gesture; avoice trigger, i.e., the user realizes a triggering operation by issuinga corresponding voice instruction; an expression trigger, i.e., the userrealizes a triggering operation by making a corresponding expression,etc., the present disclosure does not limit the triggering operation.

In an embodiment, the video processing method can be implemented in thefollowing ways.

Referring to FIG. 5 , FIG. 5 is a flow chart of a video processingmethod provided by an embodiment of the present disclosure. The videoprocessing method can include the following.

S101: play an audio in a process of capturing a video.

S102: play a first part of the audio at a changing speed according to afirst speed change mode in response to recognizing that a target objectpresents a target state in a captured video image; where the first partof the audio represents an unplayed part of the audio when it isrecognized that the target object presents the target state in thecaptured video image.

The first speed change mode can be matched with the recognized targetstate. For example, if a smile triggers a speed change processingoperation and the speed change mode is a 2x slow playback, the firstspeed change mode is 2x fast; and it can also be selected and input bythe user before capturing the video, which is not specifically limitedherein.

The method of the embodiments of the present disclosure can be appliedto a terminal device, and the terminal device can display a capturinginterface, which is a user interface for capturing a target object. Thecapturing interface includes a capturing area, a function display areaand a capturing button, and a special effect editing effect module canbe displayed in the function display area. By clicking the specialeffect editing button corresponding to the special effect editingmodule, a plurality of special effect editing effects can be displayedto the user in the capturing interface, where the plurality of specialeffect editing effects can include special effect editing effects forsupporting video speed change recording function. Where the displaymodes of the plurality of special effect editing effects are notspecifically limited.

Specifically, the user can start the capturing function by clicking thecapturing button through the terminal device, and play a default audioor a special effect editing effect self-audio or an audio selected bythe user in the process of capturing a video. In the capturingprocedure, the terminal device detects whether there is an operation forcontrolling the video speed change processing in real time, i.e.,recognizes whether the target object presents a target state, and if so,responds to the recognized target state. Firstly, when it is recognizedthat a target object presents a target state in a captured video image,a speed change mode of speed change playback required for the unplayedpart of the audio, i.e., a first speed change mode, is obtained, wherethe first speed change mode can be a mode matched with a recording moderequired by a special effect, for example, if the recording and playingmode required by the special effect is 2x slow playback, then the firstspeed change mode is 2x fast playback; and it can also be the mode ofuser input. The target state may include an interactive action and/or apreset object contained in the captured video, where the interactiveaction may include face and limb actions, such as smiling, blinking,raising a hand, lifting a leg, etc., and the recognized target objectsare preset objects such as clothes, cars, buildings, etc., which are notlimited by the present disclosure.

In practical applications, before the speed change trigger, thecapturing interface needs to be displayed and the capturing function isturned on, before this, the user is required to select the specialeffect corresponding to a speed change recording, such as a specialeffect editing effect. Specifically, the terminal device can display thecapturing interface and turn on the capturing function through thefollowing steps:

Step a1: display the capturing interface for capturing the targetobject, where the capturing interface includes a special effect editingbutton for supporting and controlling video speed change recording.

Step a2: in response to a first triggering operation acting on thespecial effect editing button, upon receiving an instruction forinstructing to perform a capturing operation on the capturing interface,perform a first capturing operation, and detect whether the targetobject presents the target state while capturing the target object inthe capturing interface, where the target state includes a state inwhich the target object presents a target action and/or a preset objectis presented in the target object.

Step a3: if it is recognized that the target object presents the targetstate in the video capturing procedure, execute an operation ofcontrolling video speed change recording.

In the embodiments of the present disclosure, when a usercapturing/recording a video (hereinafter referred to as the user) opensan application program, the terminal device can display a plurality ofswitching interfaces, and the user can click a switching button matchingwith the capturing interface to jump and display the capturing interface10. Where the capturing interface can include a special effect editingbutton for supporting and controlling video speed change recording,which can specifically be a sticker, or it can also be that one stickercorresponds to turning on a recording function with one speed; it canalso be that one sticker corresponds to recording functions withmultiple speeds. Therefore, the recording function of the correspondingspeed can be triggered according to the action of the target object orwhat the target object is. For example, smiling corresponds to 2x slowrecording, waving corresponds to 2x fast recording, etc., or a specialeffect in a sticker can be used to trigger the speed change processingoperation, and through selecting other special effect editing effects(such as a speed change mode special effect editing effect) before thevideo is captured, a user may perform a corresponding speed changeprocessing in accordance with the acquired speed change mode input bythe user upon recognizing the present target state during capturing,which is not limited by the present disclosure.

Taking the special effect in the special effect editing effect beingconfigured with speed change mode as an example, where, in the capturinginterface, before capturing is turned on, the user can click the specialeffect editing button to select the special effect editing effect usedto support and control video speed change recording from the displayarea of the special effect editing effect, where, if there are multiplespecial effect editing effects used to support and control video speedchange recording, the target special effect editing effect correspondingto the desired capturing speed can be selected from the special effectediting effect, and then the corresponding target triggering operationcan be made based on the special effect in the special effect editingeffect; and if the special effect editing effect used to support andcontrol video speed change recording is one, the corresponding targettriggering operation can be made based on the recognizable specialeffect in the special effect editing effect.

Specifically, after the user selects the special effect editing effect,the terminal device displays the background environment of the specialeffect editing effect on the capturing interface in response that thetriggering operation acting on the special effect editing button, i.e.,the first triggering operation. When the user clicks the capturingbutton to start capturing, an instruction for instructing to perform thecapturing operation on the capturing interface is generated, and theterminal device receives the instruction and starts the capturingoperation, i.e., the first capturing operation. In the capturingprocedure, it is detected whether a target triggering operation (forexample, an interactive action and/or a target object such as smiling,waving or a target building) exists in the captured video in real time.If existing, the client in the terminal device will send a speed changerecording message corresponding to the target triggering operation to anaudio and video terminal in the terminal device for speed changerecording and saving of the current music.

S103: obtain a target video according to the captured video image andthe audio.

S104: play, in the target video, a video image captured before thetarget object presents the target state at a constant speed, and play,in the target video, a video image captured after the target objectpresents the target state at a changing speed according to a secondspeed change mode, where the audio is played at a constant speed; wherea speed change direction of the first speed change mode is opposite froma speed change direction of the second speed change mode and a changingspeed of the first speed change mode is the same as a changing speed ofthe second speed change mode.

In this embodiment, the captured video image and audio are synthesizedto obtain the audio and video to be played, i.e., the target video.Then, after the capturing is complete, the terminal device plays thetarget video. In the process of playing the target video, the videoimage captured before the target object presents the target state isplayed at a constant speed and the video image captured after the targetobject presents the target state is played at a changing speed accordingto the second speed change mode, while the audio is played at a constantspeed.

The constant speed of the video herein refers to the capturing speed,and the constant speed of the audio refers to its own speed. Here, thespeed change direction the second speed change mode is opposite to thespeed change direction of the first speed change mode. For example, ifthe first speed change mode is 2x slow, the second speed change mode is2x fast. The purpose may be to play the music at a normal speed when thetarget video is played after the capturing is complete, so as to improvethe user’s viewing experience. In order to ensure that the audio playedduring playing after the capturing is complete is not interfered by theoriginal sound played/or appeared randomly before the speed changerecording, the captured original sound is not recorded during the audioprocessing.

In a possible design, the target state has a mapping relationship withthe first speed change mode; or, the second speed change mode isdetermined in response to a setting operation of a user, and the firstspeed change mode is determined based on the second speed change mode.

In this embodiment, in view of the mapping relationship between thetarget state and the first speed change mode, it may be that the specialeffect editing effect itself defaults to the type and/or the speed ofthe speed change, therefore, the target state recognized by the specialeffect editing effect can determine its corresponding speed change typeand/or speed. It is also possible to recognize a special effect throughthe special effect editing effect, and different actions in specialeffects trigger different types of speed changes, or even differentspeed changes, so as to determine the type and/or the speed of the speedchange by the target state.

How to play an unplayed part of the audio at a changing speed accordingto a first speed change mode in response to recognizing that a targetobject presents a target state in a captured video image can be realizedby the following steps.

Step b1: determine the first speed change mode having the mappingrelationship with the target state in response to recognizing that thetarget object presents the target state in the captured video image.

Step b2: play the first part of the audio at a changing speed accordingto the first speed change mode.

In this embodiment, in response the recognized target state, the firstspeed change mode matching the target state is obtained through themapping relationship, and then the first part of the audio is played ata changing speed based on the first speed change mode.

Since the second speed change mode is determined in response to asetting operation of a user, and the first speed change mode isdetermined based on the second speed change mode, the first speed changemode may be determined through the user by pushing a slider to setdifferent speeds and selecting different types of speed changes throughbuttons. Here, there is no specific limitation on that how the userselects or sets different speed change modes, and the speed change modeincludes a type of the speed change and/or a speed of the speed change.

Specifically, the second speed change mode can be a speed change moderequired by the special effect, for example, if the speed change moderequired by the special effect is 2x slow playback, the first speedchange mode is 2x fast playback, and the second speed change mode is 2xslow playback; and can also be a mode input by the user. Specifically,after the capturing is complete, it jumps to display the playinginterface, the video after speed change processing (i.e., the targetvideo) is played through the playing interface, where the target videohere includes a video image captured before the target object presentsthe target state and speed changing is performed, the video imagecaptured after the target object presents the target state and playedaccording to the second speed change mode and the audio played at aconstant speed. Where the video image played according to the secondspeed change mode can be processed by adding or subtracting a frame.

In addition, when the capturing function is turned on, the terminaldevice starts capturing the target object and plays the currentlycaptured video, where the played content here includes the video imageupon capturing the target object and the music played in the capturingprocedure. Until the speed change is triggered, the speed changeprocessing starts, and the first part of the audio is played at achanging speed according to the first speed change mode, where the firstpart of the audio here represents an unplayed part of the audio when itis recognized that the target object presents the target state in thecaptured video image. Where the starting time of the first part of theaudio is a time when the capturing interface captures the target object(i.e., a time when the speed change processing is started) in responseto recognizing that the target object presents the target state in thecaptured video image; and the termination time of the first part of theaudio is a time in response that the capturing is complete, which can betriggered by the user or a time of arrival of the capturing time. Forexample, when the start time of the first part of the audio is thecapturing start time, it means that the target state is immediatelydetected when the capturing function is started, therefore, the videoimage played after the capturing is complete is a video image with speedchange at the beginning.

Please refer to FIG. 6 for how to display the playing interface, andFIG. 6 is a schematic diagram of a scenario for a video processingmethod provided by a further embodiment of the present disclosure.Before playing the target video, the method may further include:

-   jumping to display a playing interface in response to a second    triggering operation for indicating to end capturing, where the    playing interface is used to play the target video; or,-   jumping to display the playing interface after a capturing time in    the capturing interface is over.

In the embodiments of the present disclosure, in the capturingprocedure, the user can click an ending capturing button 107 (see (a) ofFIG. 6 ), and in response to the click operation, i.e., the secondtriggering operation, the terminal device jumps from the capturinginterface 10 to the playing interface 20 and displays the playinginterface 20 (see (b) of FIG. 6 ). If the operation of the endingcapturing button 107 triggered by the user is not detected in thecapturing procedure, when the capturing time in the capturing interfaceis complete, for example, the capturing progress bar on the capturinginterface reaches the ending position (see (c) of FIG. 6 ), it willautomatically jump from the capturing interface 10 to the playinginterface 20 and display the playing interface 20 (see (b) of FIG. 6 ).The present disclosure does not specifically limit the mode of switchingand jumping between the capturing interface and the playing interface.

Taking the smile action as the target triggering operation (i.e. thetarget state), and the smiling action to represent a 2x slow recordingtriggering action as an example, the special effect editing effectcorresponding to the smile action is first added, and the music can alsobe selected or the background music of the special effect editing effectcorresponding to the smile action can be taken as the captured music,and the terminal device starts to detect the target triggeringoperation. Specifically, it is determined whether there is a smileaction, if there is a smile action, it is determined whether the currentsmile action occurs in the capturing, if the current smile action occursin the capturing, it is determined whether the speed change state istriggered, if so, a message corresponding to the rate required by thesmile action, i.e., the message recorded at 2x slow speed, is sent tothe audio and video terminal VE, and the VE perform a fast recording andsaving on the default music (the music currently played on the capturinginterface), the fast recording here can be recorded at 2x fast speed. Atthe same time, the original sound is not recorded, and then the recordedtarget audio is sent to the client, and the client plays the targetaudio at a 2x slow speed, which shortens the total time, and ends therecording and playing.

According to the present disclosure, the capturing speed can becontrolled only by smiling at the camera without the user operating thescreen. At the same time, there is no need to import the captured videointo other editing pages or software, the processing procedure issimple, and the user experience is improved. It should be noted thatsmile recognition is taken as an example to clearly explain thetriggering and processing procedure of speed change recording, but thepresent disclosure does not limit interactive behavior, includingraising a hand, gagging, shaking one’s head, etc.

The video processing method provided by the embodiments of the presentdisclosure allow for: playing an audio in a process of capturing avideo, then playing a first part of the audio at a changing speedaccording to a first speed change mode in response to recognizing that atarget object presents a target state in a captured video image, wherethe first part here represents an unplayed part of the audio when it isrecognized that the target object presents the target state in thecaptured video image; then obtaining a target video according to thecaptured video image and the audio, playing, in the target video, avideo image captured before the target object presents the target stateat a constant speed, and playing, in the target video, a video imagecaptured after the target object presents the target state at a changingspeed according to a second speed change mode, where the audio is playedat a constant speed, and where a speed change direction of the firstspeed change mode is opposite from a speed change direction of thesecond speed change mode and a changing speed of the first speed changemode is the same as a changing speed of the second speed change mode.Therefore, it is ensured that the audio is played at a constant speed inthe target video, while a part of the video image is played at aconstant speed and another part of the video image is played at achanging speed, thereby realizing the speed change recording and playingof the captured video. The present application can start the operationof controlling the video speed change recording directly by recognizingthe target state presented by the target object, without the need toimport the captured video into the video editing software or the editingpage for a corresponding slow playback process or a fast playbackprocess. In order to improve the user experience and make a user have astrong perception for the speed change recording, after the speed changerecording is triggered in the video capturing process, the played audiois played at a changing speed according to the first speed change mode,while in the process of playing the target video generated aftercapturing, for the video captured after the speed change is triggered,the video image is played at a changing speed according to the secondspeed change mode, while the audio is played at a constant speed. Thus,the personalized needs of users are satisfied, and the processingprocedure is simple and the operation steps are simplified, causing theuser to realize the speed change processing of the video throughconvenient operation, thereby improving the user experience.

In an embodiment of the present disclosure, on the basis of the aboveembodiments, the target video can be obtained at least by the followingtwo modes.

Mode 1: synthesize the video image after performing a speed changeprocessing and the audio played at a constant speed as a target video.

Step c1: record the captured video image at a constant speed to obtain afirst video.

Step c2: divide the first video into a first video segment and a secondvideo segment; the first video segment represents a segment of the firstvideo before the target object presents the target state; and the secondvideo segment represents a segment of the second video after the targetobject presents the target state.

Step c3: perform speed change processing on the second video segmentaccording to the second speed change mode to obtain a third videosegment.

Step c4: synthesize the audio in a spliced video of the first videosegment and the third video segment to obtain the target video.

In this embodiment, firstly, the captured video image is recorded at aconstant speed to obtain the first video, and then the first video isdivided into a segment (i.e., the first video segment) before the targetobject presents the target state and a segment (i.e., the second videosegment) after the target object presents the target state. The firstvideo segment is saved unchanged, and for the second video segment, thesecond video segment is processed by adding or subtracting a frameaccording to a second speed change mode, to obtain a third videosegment, where the third video segment is a video image after speedchange processing. Then the saved first video segment is spliced withthe third video segment obtained by speed change processing to obtain aspliced video, and the audio recorded and saved at a normal speed issynthesized with the spliced video to obtain a target video. Since it isbackground processing, rather than real-time processing, the speedchange processing that differs from speed change processing with theduration in prior art can be supported and the operation is simplewithout importing other editing software, thus improving the userexperience.

Mode 2: splice the video image after speed change processing with theaudio after speed change processing to generate the target video.

Step d1: record a played video image and a played audio to obtain asecond video in the process of capturing a video; and in the secondvideo, play the audio before the target object presents the target stateat a constant speed, and play the audio after the target object presentsthe target state at a changing speed according to a first speed changemode, where the captured video image is played at a constant speed.

Step d2: divide the second video into a third video segment and a fourthvideo segment; where the third video segment represents a segment of thesecond video before the target object presents the target state; and thefourth video segment represents a segment of the second video after thetarget object presents the target state.

Step d3: perform speed change processing on the fourth video segmentaccording to the second speed change mode to obtain a fifth videosegment; and splice the third video segment and the fifth video segmentto obtain the target video.

In this embodiment, in the process of capturing a video, the playedvideo image and audio are recorded to obtain the second video, and therecorded audio before the target object presents the target state isplayed at a constant speed, the recorded audio after the target objectpresents the target state is played at a changing speed according to thefirst speed change mode, and the recorded video image is played at aconstant speed. Where the second video is divided into a segment (i.e.,the third video segment) before the target object presents the targetstate and a segment (i.e., the fourth video segment) after the targetobject presents the target state.

Specifically, the third video segment is saved, and speed changeprocessing is performed on the fourth video segment according to secondspeed change mode, i.e., the image video in the fourth video segment isprocessed by adding or subtracting a frame, and the audio in the fourthvideo segment (i.e., the audio played at a changing speed based on firstspeed change mode and recorded) is played in a second speed change mode,such that the audio is played at a constant speed. Splice the savedthird video segment with the speed change processed fifth video segmentto obtain the target video. Since it is background processing, ratherthan real-time processing, the speed change processing that differs fromspeed change processing with the duration in prior art can be supportedand the operation is simple without importing other editing software,thus improving the user experience.

In order to ensure the normal speed of the audio played through thefirst speed change mode, the second speed change mode with the oppositespeed change direction and the same speed as those of the first speedchange mode can be recorded first to obtain the audio processed by thefirst speed change mode. For example, if the first speed change is a 2xslow speed, the second speed change is a 2x fast speed. Then the audioprocessed by the first speed change mode is subjected to speed changeprocessing according to the second speed change mode to obtain the audioplayed at normal speed, i.e., the audio played in the target video.

Specifically, the terminal device acquires the currently played targetmusic when the speed change recording is triggered in the capturingprocess, and then performs speed change recording on the audio afterpresenting the target state according to the determined second speedchange mode, so as to generate the speed change audio which does notcontain the original sound upon capturing in the capturing interface.

The starting time of the audio recorded by the first speed change modeis a time when the target state is presented in response to therecognition (i.e., a time when the target state is presented in responseto the recognized target object); and the ending time of the audiorecorded by the first speed change mode is a time corresponding to theprogress of playing music when the capturing interface captures thetarget object at the end of capturing (i.e., a time when the speedchange recording is complete).

The rate of the speed change includes slow speed, fast speed and normalspeed, and the speed is any value between 0.333-3, and the speed changecan be represented by speed multiplying power. Exemplarily, taking thefirst speed change as 2x slow speed as an example, for example, thevideo image recorded at the normal speed is 30 frames/s, according tothe rate of the 2x slow speed, the corresponding subtracted frameoperation is carried out on the video image, for example, the recordedvideo image is recorded at 2 s for 30 frames, so that the playing rateof each frame of video image becomes slower upon playing.

In an embodiment of the present disclosure, based on the aboveembodiments, after the capturing function is turned on, the videoprocessing method may further include:

-   in response to the second triggering operation for instructing to    end capturing, resetting at least one of the following: the first    speed change mode, the second speed change mode, a state of special    effect editing effect, and a state of the capturing interface.

In the embodiments of the present disclosure, the user can click anending button used to indicate the end of capturing in the capturinginterface. When the terminal device detects the triggering operationacting on the end button, i.e., the second triggering operation, inresponse to the second triggering operation, the terminal device sends amessage of resetting the recording speed, controls the VE to reset stateand the client to reset state, and then updates the capturing state.Specifically, the first speed change mode and/or the second speed changemode and/or the state of the special effect editing effect and/or thestate of the capturing interface are reset.

Specifically, the specific procedure of the user clicking startcapturing/stop special effect editing effect is as follows: the terminaldevice first judges whether the capturing is started, if the capturingis started, updates the capturing state and end the started capturingstate; and if the capturing is not started, judges whether the capturingis ended. If the capturing is ended, a message resetting the recordingspeed is sent to control the VE to reset state and the client to resetstate, so as to update the capturing state and end the capturing state.

In an embodiment of the present disclosure, based on the aboveembodiments, the video processing method is explained in detail. Afterjumping to display the playing interface, the video processing methodcan also be realized by the following steps.

Step e1: control the playing interface to return to the capturinginterface containing a captured target object in response to a firstreturn triggering operation.

Step e2: if an instruction to indicate a recapture is received, resetthe content of the capturing interface containing the captured targetobject and exit the capturing interface according to the instructionsfor indicating a recapture.

In the embodiments of the present disclosure, after jumping to displaythe playing interface, the recorded video, i.e., the target video, isplayed on the display interface. When the user wants to return to thecapturing interface for re-editing, re-capturing or continuouscapturing, the user can click the first return button on the displayinterface, in response to the first return triggering operation actingon the first return button, the terminal device controls the playinginterface to return to the capturing interface containing the capturedtarget object, i.e., jumps to display the capturing interface in a pausecapturing state. The user can choose to cancel the current capturingcontent in the current capturing interface, i.e., the user can click thesecond back button on the current capturing interface, generate aninstruction for re-capturing, and send the instruction to the terminaldevice, the terminal device receives the instruction, resets the contentof the capturing interface and exits the current capturing interfaceaccording to the instruction. It is convenient for the user tore-capture, and the interactive operation is simple.

In an embodiment of the present disclosure, based on the aboveembodiments, the video processing method is explained in detail. Thevideo processing method may further include the following steps.

Step f1: after controlling the playing interface to return to thecapturing interface containing a captured target object or when thecapturing interface receives an instruction to suspend capturing uponcapturing the target object in the capturing interface, detect whetherto the progress bar of capturing time on the current capturing interfacereaches the ending position.

Step f2: if the ending position is not reached, when an instruction forinstructing to perform the capturing operation on the capturinginterface is received, control to continue performing the capturingoperation on the capturing interface.

In the embodiments of the present disclosure, if the capturing time inthe capturing interface is not over yet, the user can continue capturingin the current capturing interface immediately after the originalcapturing content. It can be realized by at least two scenarios asfollows.

Scenario 1: the current interface is in the playing interface, and theuser can click the first return button in the playing interface to makethe terminal device to return the playing interface to the capturinginterface in the pause capturing state. In this state, the progress barof the capturing time is displayed on the capturing interface, andwhether capturing can be continued is determined by detecting whetherthe progress bar of the capturing time on the capturing interfacereaches the ending position. If the ending position is not reached, theuser can click the capturing button on the capturing interface in thisstate, and the terminal device receives an instruction for instructingto perform capturing operation on the capturing interface, and continuesperforming the capturing operation on the capturing interface accordingto the instruction.

Scenario 2: the current interface is in the capturing interface, and thecapturing interface is in the capturing pause state in response to theinstruction input by the user to indicate a capturing pause. In thisstate, the progress bar of the capturing time is displayed on thecapturing interface, and whether capturing can be continued isdetermined by detecting whether the progress bar of the capturing timeon the capturing interface, in this state, reaches the ending position.If the ending position is not reached, the user can click the capturingbutton on the capturing interface in this state, and the terminal devicereceives an instruction for instructing to perform capturing operationon the capturing interface, and continues performing capturing operationon the capturing interface according to the instruction.

Thus, the present disclosure aims to reduce the threshold for the userto capture a “slow-motion transformation” class of video, to cause theuser to record the slow-motion video more than 1 s or record fast motionthrough special effect editing effect, and solve the problem of a musicand image mismatch, meanwhile in a special effect package, a logic to“recognize a certain triggering condition, and allow the recording tochange a speed” is added, and the speed change playing event is passedto the audio and video terminal and the client in real time by thespecial effect to perform speed change recording in the audio and videoterminal, enabling the user to more simply produce a higher qualityslow-motion video or a higher quality fast-motion video, thus improvingthe user experience.

FIG. 7 is a structural block diagram of a video processing apparatusprovided by an embodiment of the present disclosure, corresponding tothe video processing method of the embodiment disclosed above. Where thevideo processing apparatus can be a terminal device. For the convenienceof explanation, only parts related to the embodiment of the presentdisclosure are shown. Referring to FIG. 7 , the video processingapparatus 70 may include: a capturing module 701, a processing module702 and a playing module 703. The capturing module 701 is configured toplay an audio in a process of capturing a video; the processing module702 is configured to play a first part of the audio at a changing speedaccording to a first speed change mode in response to recognizing that atarget object presents a target state in a captured video image; wherethe first part of the audio represents an unplayed part of the audiowhen it is recognized that the target object presents the target statein the captured video image; the processing module 702 is furtherconfigured to obtain a target video according to the captured videoimage and the audio; and the playing module 703 is configured to play,in the target video, a video image captured before the target objectpresents the target state at a constant speed, and play, in the targetvideo, a video image captured after the target object presents thetarget state at a changing speed according to a second speed changemode, where the audio is played at a constant speed; and where a speedchange direction of the first speed change mode is opposite from a speedchange direction of the second speed change mode and a changing speed ofthe first speed change mode is the same as a changing speed of thesecond speed change mode.

The capturing module 701, the processing module 702, and the playingmodule 703 provided by the embodiments of the present disclosure playthe video in the process of capturing a video, then play a first part ofthe audio at a changing speed according to a first speed change mode inresponse to recognizing that a target object presents a target state ina captured video image, where the first part here represents an unplayedpart of the audio when it is recognized that the target object presentsthe target state in the captured video image; then obtain a target videoaccording to the captured video image and the audio, play, in the targetvideo, a video image captured before the target object presents thetarget state at a constant speed, and play, in the target video, a videoimage captured after the target object presents the target state at achanging speed according to a second speed change mode, where the audiois played at a constant speed, and where a speed change direction of thefirst speed change mode is opposite from a speed change direction of thesecond speed change mode and a changing speed of the first speed changemode is the same as a changing speed of the second speed change mode.Therefore, it is ensured that the audio is played at a constant speed inthe target video, while a part of the video image is played at aconstant speed and another part of the video image is played at achanging speed, thereby realizing the speed change recording and playingof the captured video. The present application can start the operationof controlling the video speed change recording directly by recognizingthe target state presented by the target object, without the need toimport the captured video into the video editing software or the editingpage for a corresponding slow playback process or a fast playbackprocess. In order to improve the user experience and make a user have astrong perception for the speed change recording, after the speed changerecording is triggered in the video capturing process, the played audiois played at a changing speed according to the first speed change mode,while in the process of playing the target video generated aftercapturing, for the video captured after the speed change is triggered,the video image is played at a changing speed according to the secondspeed change mode, while the audio is played at a constant speed. Thus,the personalized needs of users are satisfied, and the processingprocedure is simple and the operation steps are simplified, causing theuser to realize the speed change processing of the video throughconvenient operation, thereby improving the user experience.

The apparatus provided by the embodiment of the present disclosure canbe used to perform the technical solution of the video processing methodembodiment described in the first aspect above, and their implementationprinciples and technical effects are similar, and the embodiment of thepresent disclosure will not be repeated here.

In an embodiment of the present disclosure, the embodiment of thepresent disclosure describes the video processing apparatus in detail onthe basis of the above disclosed embodiments. Where the target state hasa mapping relationship with the first speed change mode; the processingmodule 702 is specifically configured to: determine the first speedchange mode having the mapping relationship with the target state inresponse to recognizing that the target object presents the target statein the captured video image; and play the first part of the audio at achanging speed according to the first speed change mode.

In an embodiment of the present disclosure, the target state includes astate in which the target object presents a target action and/or apreset object is presented in the target object.

In an embodiment of the present disclosure, the processing module 702 isspecifically configured to: record the captured video image at aconstant speed to obtain a first video; divide the first video into afirst video segment and a second video segment; the first video segmentrepresents a segment of the first video before the target objectpresents the target state; and the second video segment represents asegment of the second video after the target object presents the targetstate; perform speed change processing on the second video segmentaccording to the second speed change mode to obtain a third videosegment; and synthesize the audio in a spliced video of the first videosegment and the third video segment to obtain the target video.

In an embodiment of the present disclosure, the processing module 702 isfurther specifically configured to: record a played video image and aplayed audio to obtain a second video in the process of capturing avideo; and in the second video, play the audio before the target objectpresents the target state at a constant speed, and play the audio afterthe target object presents the target state at a changing speedaccording to a first speed change mode, where the captured video imageis played at a constant speed; divide the second video into a thirdvideo segment and a fourth video segment; where the third video segmentrepresents a segment of the second video before the target objectpresents the target state; and the fourth video segment represents asegment of the second video after the target object presents the targetstate; and perform speed change processing on the fourth video segmentaccording to the second speed change mode to obtain a fifth videosegment; and splice the third video segment and the fifth video segmentto obtain the target video.

In an embodiment of the present disclosure, the second speed change modeis determined in response to a setting operation of a user, and thefirst speed change mode is determined based on the second speed changemode.

In an embodiment of the present disclosure, the processing module 702 isfurther configured to: in response to the second triggering operationfor instructing to end capturing, reset at least one of the following:the first speed change mode, the second speed change mode, a state ofspecial effect editing effect, and a state of the capturing interface.

In an embodiment of the present disclosure, the playing module 703 isfurther configured to: before playing the target video, jump to displaya playing interface in response to a second triggering operation forindicating to end capturing, where the playing interface is configuredto play the target video.

In an embodiment of the present disclosure, the processing module 702 isfurther configured to: after the jumping to display the playinginterface, control the playing interface to return to the capturinginterface containing a captured target object in response to a firstreturn triggering operation; and when an instruction to indicate arecapture is received, reset the content of the capturing interfacecontaining the captured target object and exit the capturing interfaceaccording to the instructions for indicating a recapture.

In an embodiment of the present disclosure, the processing module 702 isfurther configured to: after controlling the playing interface to returnto the capturing interface containing a captured target obj ect or whenthe capturing interface receives an instruction to suspend capturingupon capturing the target object in the capturing interface, detectwhether to the progress bar of capturing time on the current capturinginterface reaches the ending position; and if the ending position is notreached, when an instruction for instructing to perform the capturingoperation on the capturing interface is received, control to continueperform the capturing operation on the capturing interface.

In an embodiment of the present disclosure, the playing module 703 isfurther configured to: before playing the target video, jump to displaythe playing interface after a capturing time in the capturing interfaceis over.

The above modules can be implemented as software components executed onone or more general-purpose processors, or as hardware, such asprogrammable logic devices and/or application-specific integratedcircuits, that perform certain functions or their combinations. In someembodiments, these modules can be embodied in the form of softwareproducts, which can be stored in non-volatile storage media, includingthose that enable computer devices (such as personal computers, servers,network devices, mobile terminals, etc.) to implement the methodsdescribed in the embodiments of the present invention. In an embodiment,the above modules can also be implemented on a single device ordistributed on multiple devices. The functions of these modules can becombined with each other or further split into multiple sub-modules.

A person skilled in the art can clearly understand that for theconvenience and brevity of the description, specific working processesof a video processing apparatus described above can refer to thecorresponding processes in the foregoing method embodiments, and detailsare not described herein again.

Based on the same inventive concept as the method, an embodiment of thepresent application further provides an electronic device, including aprocessor and a memory;

the memory is configured to store a program for executing the methodsdescribed in the above-mentioned various method embodiments; and theprocessor is configured to execute the program stored in the memory.

The electronic device can be a terminal device. The electronic deviceincludes a processor and a memory; the memory is configured to store aprogram for executing the methods described in the above-mentionedvarious method embodiments; the processor is configured to execute theprogram stored in the memory. The processor may be a processing unitincluding a central processing unit (CPU) or other forms with dataprocessing capability and/or instruction execution capability, and maycontrol other components in the electronic device to perform desiredfunctions. The memory may include one or more computer program products,which may include various forms of computer readable storage media, suchas volatile memories and/or nonvolatile memories. The volatile memoriesmay include, for example, a random access memory (RAM) and/or a cache.The nonvolatile memories may include, for example, a read only memory(ROM), a hard disk, a flash memory, etc. One or more computer programinstructions may be stored on the computer readable storage medium, andthe processor may run the program instructions to realize the functionsof the embodiments of the present disclosure described above and/orother desired functions.

The terminal device may include, but is not limited to: a mobileterminal such as a mobile phone, a notebook computer, a digitalbroadcast receivers, a personal digital assistant (PDA), a table personcomputer, a portable media player (PMP), and an on-vehicle terminal(e.g., an on-vehicle navigation terminal), a wearable electronic device,and a stationary terminal such as a digital TV and a desktop computer,etc. The electronic device shown in FIG. 8 is only an example, whichshould not bring any limitation to the function and the use range of theembodiments of the present disclosure.

Referring to FIG. 8 , it shows a schematic structural diagram of anelectronic device suitable for implementing the embodiment of thepresent disclosure. As shown in FIG. 8 , the electronic device mayinclude a processing apparatus (such as a central processing unit, agraphics processor, etc.) 801, which may perform various appropriateactions according to a program stored in a read only memory (ROM) 802 ora program loaded into a random access memory (RAM) 803 from a storageapparatus 808. In the RAM 803, various programs and data required forthe operation of the electronic device are also stored. The processingapparatus 801, the ROM 802, and the RAM 803 are connected to each otherthrough a bus 804. An input/output (I/O) interface 805 is also connectedto the bus 804.

Usually, the following apparatuses can be connected to the I/O interface805: an input apparatus 806 including, such as, a touch screen, atouchpad, a keyboard, a mouse, a camera, a microphone, an accelerometer,a gyroscope, etc.; an output apparatus 807 including, such as, a liquidcrystal display (LCD), a speaker, a vibrator, etc.; the storageapparatus 808 including, such as, a magnetic tape, a hard disk, etc.;and a communication apparatus 809. The communication apparatus 809 mayallow the electronic device to perform wireless or wired communicationwith other devices to exchange data. Although FIG. 8 shows an electronicdevice having various apparatuses, it should be understood that it isnot required to implement or have all of the apparatuses illustrated.More or fewer apparatuses may alternatively be implemented or provided.

In particular, according to the embodiments of the present disclosure,the process described above with reference to the flow chart can beimplemented as a computer software program. For example, an embodimentof the present disclosure includes a computer program product includinga computer program carried on a computer readable medium, where thecomputer program contains program codes for executing the method shownin the flow chart. In such embodiment, the computer program can bedownloaded and installed from a network through the communicationapparatus 809, or installed from the storage apparatus 808 or installedfrom the ROM 802. When the computer program is executed by theprocessing apparatus 801, the above functions defined in the methods ofthe embodiments of the present disclosure are executed. The embodimentsof the present disclosure also include a computer program which, whenexecuted by a processor, executes the above functions defined in themethod of the embodiments of the present disclosure.

It should be noted that the above computer readable medium in thepresent disclosure can be a computer readable signal medium or acomputer readable storage medium or a combination of the two. Thecomputer readable storage medium can be, for example, but not limitedto, an electrical, a magnetic, an optical, an electromagnetic, aninfrared, or a semiconductor system, apparatus, or device, or anycombination of the above. More specific examples of computer-readablestorage media may include, but are not limited to, electricalconnections with one or more wires, portable computer disks, hard disks,random access memories (RAM), read only memories (ROM), erasableprogrammable read-only memories (EPROM), flash memories, optical fibers,compact disk-read only memories (CD-ROM), optical storage devices, andmagnetic storage devices, or any suitable combination of the above. Inthe present disclosure, the computer readable storage medium can be anytangible medium containing or storing a program, where the program canbe used by or in connection with an instruction execution system,apparatus or device. And in the present disclosure, the computerreadable signal medium may include a data signal propagated in basebandor as a part of a carrier wave, in which computer readable program codesare carried. This propagated data signal can adopt various manners,including but not limited to an electromagnetic signal, an opticalsignal or any suitable combination of the above. The computer readablesignal medium can also be any computer readable medium other than acomputer readable storage medium, and the computer readable signalmedium can send, propagate or transmit a program for use by or inconnection with an instruction execution system, apparatus or device.The program code contained on the computer readable medium can betransmitted by any suitable medium, including, but not limited to, anelectric wire, an optical cable, radio frequency (RF), etc., or anysuitable combination of the above.

The above computer readable medium may be contained in the aboveelectronic device; or may exist separately instead of being assembledinto the electronic device.

The above computer readable medium carries one or more programs, and theone or more programs, when executed by the electronic device, cause theelectronic device to execute the methods shown in the above disclosedembodiments.

The computer program codes for executing the operations of the presentdisclosure can be written in one or more programming languages or acombination thereof, where the above programming languages includeobject-oriented programming languages such as Java, Smalltalk, C++, andalso conventional procedural programming languages such as “C” languageor similar programming languages. The program codes can be completelyexecuted on the user computer, partially executed on the user computer,executed, as an independent software package, partially executed on theuser computer and partially executed on a remote computer, or completelyexecuted on a remote computer or a server. In case of involving a remotecomputer, the remote computer can be connected to the user computerthrough any kind of networks, including a local area network (LAN) or awide area network (WAN), or can be connected to an external computer(for example, by using Internet service providers to connect through theInternet).

The flow charts and block diagrams in the drawings illustrate thearchitecture, functions and operations of possible implementations ofsystems, methods and computer program products according to variousembodiments of the present disclosure. In this regard, each block in theflow charts or block diagrams may represent a module, a program segment,or a part of codes, where the module, the program segment, or the partof codes contains one or more executable instructions for implementingspecified logical functions. It should also be noted that, in somealternative implementations, the functions indicated in the blocks mayalso occur in a different order than those indicated in the drawings.For example, two blocks shown one after the other can actually beexecuted in parallel substantially, or they may sometimes be executed inan opposite order, depending on functions involved. It should also benoted that each block in the block diagrams and/or flow charts, andcombinations of blocks in the block diagrams and/or flow charts, can beimplemented with a dedicated hardware-based system that performsspecified functions or operations, or can be implemented withcombinations of dedicated hardware and computer instructions.

An involved unit described in the embodiments of the present disclosurecan be implemented by software or hardware. Where a name of the unitdoes not constitute a limitation on the unit itself in some cases. Forexample, a first acquiring unit can also be described as “a unit foracquiring at least two Internet protocol addresses”.

The above functions described herein may be performed, at least in part,by one or more hardware logic components. For example, withoutlimitation, exemplary types of hardware logic components that can beused include: a field programmable gate array (FPGA), an applicationspecific integrated circuit (ASIC), an application specific standardproduct (ASSP), a system on chip (SOC), a complex programmable logicdevice (CPLD), etc.

In the context of the present disclosure, a machine readable medium maybe a tangible medium that may contain or store a program for use by orin connection with an instruction execution system, apparatus or device.The machine readable medium may be a machine readable signal medium or amachine readable storage medium. The machine readable medium mayinclude, but is not limited to, an electronic, a magnetic, an optical,an electromagnetic, an infrared, or a semiconductor system, apparatus,or device, or any suitable combination of the above contents. Morespecific examples of the machine readable medium may include anelectrical connection based on one or more wires, a portable computerdisk, a hard disk, a random access memory (RAM), a read only memory(ROM), an erasable programmable read only memory (EPROM), a flashmemory, an optical fiber, a portable compact disc read only memory(CD-ROM), an optical storage device, a magnetic storage device, or anysuitable combination of the above contents.

An embodiment of the present disclosure provides a computer programproduct, including a computer program, which is characterized in thatwhen executed by a processor, the computer program realizes the videoprocessing method as described in the first aspect above.

In a first aspect, an embodiment of the present disclosure provides avideo processing method, including:

-   playing an audio in a process of capturing a video;-   playing a first part of the audio at a changing speed according to a    first speed change mode in response to recognizing that a target    object presents a target state in a captured video image; where the    first part of the audio represents an unplayed part of the audio    when it is recognized that the target object presents the target    state in the captured video image;-   obtaining a target video according to the captured video image and    the audio;-   playing, in the target video, a video image captured before the    target object presents the target state at a constant speed, and    playing, in the target video, a video image captured after the    target object presents the target state at a changing speed    according to a second speed change mode, where the audio is played    at a constant speed; and-   where a speed change direction of the first speed change mode is    opposite from a speed change direction of the second speed change    mode and a changing speed of the first speed change mode is the same    as a changing speed of the second speed change mode.

According to one or more embodiments of the present disclosure, thetarget state has a mapping relationship with the first speed changemode;

-   playing the unplayed part of the audio at the changing speed    according to the first speed change mode in response to recognizing    that the target object presents the target state in the captured    video image includes:-   determining the first speed change mode having the mapping    relationship with the target state in response to recognizing that    the target object presents the target state in the captured video    image; and-   playing the first part of the audio at a changing speed according to    the first speed change mode.

According to one or more embodiments of the present disclosure, thetarget state includes a state in which the target object presents atarget action and/or a preset object is presented in the target object.

According to one or more embodiments of the present disclosure, theobtaining the target video according to the captured video image and theaudio includes:

-   recording the captured video image at a constant speed to obtain a    first video;-   dividing the first video into a first video segment and a second    video segment; the first video segment represents a segment of the    first video before the target object presents the target state; and    the second video segment represents a segment of the second video    after the target object presents the target state;-   performing speed change processing on the second video segment    according to the second speed change mode to obtain a third video    segment; and-   synthesizing the audio in a spliced video of the first video segment    and the third video segment to obtain the target video.

According to one or more embodiments of the present disclosure, theobtaining the target video according to the captured video image and theaudio includes:

-   recording a played video image and a played audio to obtain a second    video in the process of capturing a video; and in the second video,    playing the audio before the target object presents the target state    at a constant speed, and playing the audio after the target object    presents the target state at a changing speed according to a first    speed change mode, where the captured video image is played at a    constant speed;-   dividing the second video into a third video segment and a fourth    video segment; where the third video segment represents a segment of    the second video before the target object presents the target state;    and the fourth video segment represents a segment of the second    video after the target object presents the target state; and-   performing speed change processing on the fourth video segment    according to the second speed change mode to obtain a fifth video    segment; and splicing the third video segment and the fifth video    segment to obtain the target video.

According to one or more embodiments of the present disclosure, thesecond speed change mode is determined in response to a settingoperation of a user, and the first speed change mode is determined basedon the second speed change mode.

In a second aspect, an embodiment of the present disclosure provides avideo processing apparatus, where the apparatus includes:

-   a capturing module, configured to play an audio in a process of    capturing a video;-   a processing module, configured to play a first part of the audio at    a changing speed according to a first speed change mode in response    to recognizing that a target object presents a target state in a    captured video image; where the first part of the audio represents    an unplayed part of the audio when it is recognized that the target    object presents the target state in the captured video image;-   the processing module is further configured to obtain a target video    according to the captured video image and the audio; and-   a playing module, configured to play, in the target video, a video    image captured before the target object presents the target state at    a constant speed, and play, in the target video, a video image    captured after the target object presents the target state at a    changing speed according to a second speed change mode, where the    audio is played at a constant speed;-   where a speed change direction of the first speed change mode is    opposite from a speed change direction of the second speed change    mode and a changing speed of the first speed change mode is the same    as a changing speed of the second speed change mode.

According to one or more embodiments of the present disclosure, thetarget state has a mapping relationship with the first speed changemode; the processing module is specifically configured to: determine thefirst speed change mode having the mapping relationship with the targetstate in response to recognizing that the target object presents thetarget state in the captured video image; and play the first part of theaudio at a changing speed according to the first speed change mode.

According to one or more embodiments of the present disclosure, thetarget state includes a state in which the target object presents atarget action and/or a preset object is presented in the target object.

According to one or more embodiments of the present disclosure, theprocessing module is specifically configured to: record the capturedvideo image at a constant speed to obtain a first video; divide thefirst video into a first video segment and a second video segment; thefirst video segment represents a segment of the first video before thetarget object presents the target state; and the second video segmentrepresents a segment of the second video after the target objectpresents the target state; perform speed change processing on the secondvideo segment according to the second speed change mode to obtain athird video segment; and synthesize the audio in a spliced video of thefirst video segment and the third video segment to obtain the targetvideo.

According to one or more embodiments of the present disclosure, theprocessing module is further specifically configured to: record a playedvideo image and a played audio to obtain a second video in the processof capturing a video; and in the second video, play the audio before thetarget object presents the target state at a constant speed, and playthe audio after the target object presents the target state at achanging speed according to a first speed change mode, where thecaptured video image is played at a constant speed; divide the secondvideo into a third video segment and a fourth video segment; where thethird video segment represents a segment of the second video before thetarget object presents the target state; and the fourth video segmentrepresents a segment of the second video after the target objectpresents the target state; and perform speed change processing on thefourth video segment according to the second speed change mode to obtaina fifth video segment; and splice the third video segment and the fifthvideo segment to obtain the target video.

According to one or more embodiments of the present disclosure, thesecond speed change mode is determined in response to a settingoperation of a user, and the first speed change mode is determined basedon the second speed change mode.

In a third aspect, an embodiment of the present disclosure provides anelectronic device, including: at least one processor and a memory;

-   the memory stores computer-executed instructions;-   where the at least one processor executes the computer-executed    instructions stored in the memory to cause the at least one    processor to execute the video processing method as described in any    one of the above first aspect.

In a fourth aspect, an embodiment of the present disclosure provides acomputer readable storage medium storing computer-executed instructionswhich, when executed by a processor, implement the video processingmethod as described in any one of the above first aspect.

In a fifth aspect, an embodiment of the present disclosure provides acomputer program product including a computer program which, whenexecuted by a processor, implements the video processing method asdescribed in any one of the above first aspect.

In a sixth aspect, an embodiment of the present disclosure provides acomputer program which, when executed by a processor, implements thevideo processing method as described in any one of the above firstaspect.

The above description is only a description of preferred embodiments ofthe present disclosure and applied technical principles. It should beunderstood by those skilled in the art that the disclosure scopeinvolved in the present disclosure is not limited to the technicalsolution formed by the specific combination of the above technicalfeatures, but also covers other technical solutions formed by anycombination of the above technical features or equivalent featuresthereof without departing from the above disclosure concept. Forexample, the technical solution formed by replacing the above featureswith the technical features with similar functions disclosed in (but notlimited to) the present disclosure.

In addition, although the operations are depicted in a particular order,this should not be understood as requiring that the operations areperformed in the particular order shown or in a sequential order. Undercertain circumstances, multitasking and parallel processing may beadvantageous. Similarly, although several specific implementationdetails are included in the above discussion, these should not beinterpreted as limitations on the scope of the present disclosure.Certain features described in the context of separate embodiment mayalso be implemented in a single embodiment in combination. On thecontrary, various features described in the context of a singleembodiment can also be implemented in multiple embodiments individuallyor in any suitable sub-combination.

Although the subject matter has been described in a language specific tostructural features and/or logical acts of methods, it should beunderstood that the subject matter defined in the appended claims is notnecessarily limited to the specific features or acts described above. Onthe contrary, the specific features and actions described above aremerely exemplary forms for implementing the claims.

What is claimed is:
 1. A video processing method, wherein the methodcomprises: playing an audio in a process of capturing a video; playing afirst part of the audio at a changing speed according to a first speedchange mode in response to recognizing that a target object presents atarget state in a captured video image; wherein the first part of theaudio represents an unplayed part of the audio when it is recognizedthat the target object presents the target state in the captured videoimage; obtaining a target video according to the captured video imageand the audio; and playing, in the target video, a video image capturedbefore the target object presents the target state at a constant speed,and playing, in the target video, a video image captured after thetarget object presents the target state at a changing speed according toa second speed change mode, wherein the audio is played at a constantspeed; and wherein a speed change direction of the first speed changemode is opposite from a speed change direction of the second speedchange mode and a changing speed of the first speed change mode is thesame as a changing speed of the second speed change mode.
 2. The methodaccording to claim 1, wherein the target state has a mappingrelationship with the first speed change mode; playing the unplayed partof the audio at the changing speed according to the first speed changemode in response to recognizing that the target object presents thetarget state in the captured video image comprises: determining thefirst speed change mode having the mapping relationship with the targetstate in response to recognizing that the target object presents thetarget state in the captured video image; and playing the first part ofthe audio at a changing speed according to the first speed change mode.3. The method according to claim 1, wherein the target state comprisesat least one of a state in which the target object presents a targetaction and a state in which a preset object is presented in the targetobject.
 4. The method according to claim 1, wherein the obtaining thetarget video according to the captured video image and the audiocomprises: recording the captured video image at a constant speed toobtain a first video; dividing the first video into a first videosegment and a second video segment; the first video segment represents asegment of the first video before the target object presents the targetstate; and the second video segment represents a segment of the secondvideo after the target object presents the target state; performingspeed change processing on the second video segment according to thesecond speed change mode to obtain a third video segment; andsynthesizing the audio in a spliced video of the first video segment andthe third video segment to obtain the target video.
 5. The methodaccording to claim 1, wherein the obtaining the target video accordingto the captured video image and the audio comprises: recording a playedvideo image and a played audio to obtain a second video in the processof capturing a video; and in the second video, playing the audio beforethe target object presents the target state at a constant speed, andplaying the audio after the target object presents the target state at achanging speed according to a first speed change mode, wherein thecaptured video image is played at a constant speed; dividing the secondvideo into a third video segment and a fourth video segment; wherein thethird video segment represents a segment of the second video before thetarget object presents the target state; and the fourth video segmentrepresents a segment of the second video after the target objectpresents the target state; and performing speed change processing on thefourth video segment according to the second speed change mode to obtaina fifth video segment; and splicing the third video segment and thefifth video segment to obtain the target video.
 6. The method accordingto claim 1, wherein the second speed change mode is determined inresponse to a setting operation of a user, and the first speed changemode is determined based on the second speed change mode.
 7. The methodaccording to claim 1, wherein before the video capturing process, themethod further comprises: displaying a capturing interface and startinga capturing function, comprising: displaying the capturing interface forcapturing the target object, wherein the capturing interface comprises aspecial effect editing button for supporting and controlling video speedchange recording; and in response to a first triggering operation actingon the special effect editing button, upon receiving an instruction forinstructing to perform a capturing operation on the capturing interface,performing a first capturing operation, and recognizing whether thetarget object presents the target state at the time of capturing thetarget object in the capturing interface.
 8. The method according toclaim 1, wherein the method further comprises: jumping to display aplaying interface in response to a second triggering operation forindicating to end capturing, wherein the playing interface is used toplay the target video; or, jumping to display the playing interfaceafter a capturing time in the capturing interface is over; and playingthe target video.
 9. The method according to claim 8, wherein after thecapturing function is turned on, the method further comprises: inresponse to the second triggering operation for instructing to end thecapturing, resetting at least one of the following: the first speedchange mode, the second speed change mode, a state of special effectediting effect, and a state of the capturing interface.
 10. The methodaccording to claim 8, wherein after jumping to display the playinginterface, the method further comprises: controlling the playinginterface to return to the capturing interface containing a capturedtarget object in response to a first return triggering operation.
 11. Anelectronic device, comprising: at least one processor; and a memory;wherein the memory stores computer-executed instructions; and the atleast one processor executes the computer-executed instructions storedin the memory to cause the at least one processor to: play an audio in aprocess of capturing a video; play a first part of the audio at achanging speed according to a first speed change mode in response torecognizing that a target object presents a target state in a capturedvideo image; wherein the first part of the audio represents an unplayedpart of the audio when it is recognized that the target object presentsthe target state in the captured video image; obtain a target videoaccording to the captured video image and the audio; and play, in thetarget video, a video image captured before the target object presentsthe target state at a constant speed, and play, in the target video, avideo image captured after the target object presents the target stateat a changing speed according to a second speed change mode, wherein theaudio is played at a constant speed; and wherein a speed changedirection of the first speed change mode is opposite from a speed changedirection of the second speed change mode and a changing speed of thefirst speed change mode is the same as a changing speed of the secondspeed change mode.
 12. The electronic device according to claim 11,wherein the at least one processor is configured to execute thecomputer-executed instructions in the memory to: determine the firstspeed change mode having the mapping relationship with the target statein response to recognizing that the target object presents the targetstate in the captured video image; and play the first part of the audioat a changing speed according to the first speed change mode.
 13. Theelectronic device according to claim 11, wherein the target statecomprises at least one of a state in which the target object presents atarget action and a state in which a preset object is presented in thetarget object.
 14. The electronic device according to claim 11, whereinthe at least one processor is configured to execute thecomputer-executed instructions in the memory to: record the capturedvideo image at a constant speed to obtain a first video; divide thefirst video into a first video segment and a second video segment; thefirst video segment represents a segment of the first video before thetarget object presents the target state; and the second video segmentrepresents a segment of the second video after the target objectpresents the target state; perform speed change processing on the secondvideo segment according to the second speed change mode to obtain athird video segment; and synthesize the audio in a spliced video of thefirst video segment and the third video segment to obtain the targetvideo.
 15. The electronic device according to claim 11, wherein the atleast one processor is configured to execute the computer-executedinstructions in the memory to: record a played video image and a playedaudio to obtain a second video in the process of capturing a video; andin the second video, play the audio before the target object presentsthe target state at a constant speed, and play the audio after thetarget object presents the target state at a changing speed according toa first speed change mode, wherein the captured video image is played ata constant speed; divide the second video into a third video segment anda fourth video segment; wherein the third video segment represents asegment of the second video before the target object presents the targetstate; and the fourth video segment represents a segment of the secondvideo after the target object presents the target state; and performspeed change processing on the fourth video segment according to thesecond speed change mode to obtain a fifth video segment; and splice thethird video segment and the fifth video segment to obtain the targetvideo.
 16. The electronic device according to claim 11, wherein thesecond speed change mode is determined in response to a settingoperation of a user, and the first speed change mode is determined basedon the second speed change mode.
 17. The electronic device according toclaim 11, wherein the at least one processor is configured to executethe computer-executed instructions in the memory to: display thecapturing interface for capturing the target object, wherein thecapturing interface comprises a special effect editing button forsupporting and controlling video speed change recording; and perform afirst capturing operation, and recognize whether the target objectpresents the target state at the time of capturing the target object inthe capturing interface in response to a first triggering operationacting on the special effect editing button, upon receiving aninstruction for instructing to perform a capturing operation on thecapturing interface.
 18. The electronic device according to claim 11,wherein the at least one processor is configured to execute thecomputer-executed instructions in the memory to: jump to display aplaying interface in response to a second triggering operation forindicating to end capturing, wherein the playing interface is used toplay the target video; or, jump to display the playing interface after acapturing time in the capturing interface is over ; and play the targetvideo.
 19. The electronic device according to claim 18, wherein afterthe capturing function is turned on, the at least one processor isconfigured to execute the computer-executed instructions in the memoryto: reset at least one of the following: the first speed change mode,the second speed change mode, a state of special effect editing effect,and a state of the capturing interface in response to the secondtriggering operation for instructing to end the capturing.
 20. Anon-transitory computer readable storage medium storingcomputer-executed instructions which, when executed by a processor,implement: play an audio in a process of capturing a video; play a firstpart of the audio at a changing speed according to a first speed changemode in response to recognizing that a target object presents a targetstate in a captured video image; wherein the first part of the audiorepresents an unplayed part of the audio when it is recognized that thetarget object presents the target state in the captured video image;obtain a target video according to the captured video image and theaudio; and play, in the target video, a video image captured before thetarget object presents the target state at a constant speed, and play,in the target video, a video image captured after the target objectpresents the target state at a changing speed according to a secondspeed change mode, wherein the audio is played at a constant speed; andwherein a speed change direction of the first speed change mode isopposite from a speed change direction of the second speed change modeand a changing speed of the first speed change mode is the same as achanging speed of the second speed change mode.